15 

WE CLAIM: 

1 . A method for processing data arrays that collectively describe cy- 
clic behavior of at least one variable in several entities in a physical process; 

the method comprising: 

determining a first cycle in the cyclic behavior and dividing the first 
cycle into multiple time slots; 

determining multiple data arrays, each data array containing multi- 
ple data items such that each data item describes a variable of an entity in one 
time slot; 

for each of the several entities, determining a specific magnitude pa- 
rameter; 

scaling the data arrays between entities such that the specific mag- 
nitude parameters are suppressed; 

training a clustering system with a first plurality of the scaled data 
arrays to determine a set of cluster centers; and 

using the trained clustering system to cluster a second plurality of 
the scaled data arrays. 

2. A method according to claim 1, wherein the specific magnitude 
parameters are determined separately for each first cycle. 

3. A method according to claim 1, further comprising determining a 
second cycle that is a multiple of the first cycle and re-clustering the clustered 
data arrays in respect of the second cycle. 

4. A method according to claim 1, further comprising processing the 
suppressed specific magnitude parameter separately from the clustering sys- 
tem. 

5. A method according to claim 1, wherein the clustering system is 
an unsupervised clustering system. 

6. A method according to claim 5, further comprising initializing the 
unsupervised clustering system with a-priori seed values prior to said training. 



ters; 



7. A method according to claim 1 , further comprising: 

associating predetermined confidence interval with the cluster cen- 
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for each clustered data array of the second plurality, determining a 
best-matching cluster center and checking if the clustered data array is within 
said predetermined confidence interval of the best-matching cluster center; 

if yes, archiving an indicator of the best-matching cluster center and 
discarding the data array in question; or 

if not, archiving data items of the data array for those time slots in 
which the clustered data array is not said within said predetermined confidence 
interval. 

8. A method according to claim 7, wherein the confidence interval 
narrows progressively with increasing magnitude parameter of the variable 
described. 

9. A method according to claim 1, further comprising using the clus- 
tered second plurality of the scaled data arrays to detect anomalous situations. 

10. A method according to claim 1, further comprising using the 
clustered second plurality of the scaled data arrays to determine a pricing 
strategy. 

11. A method according to claim 1, wherein the scaled data arrays 
represent usage of services by various subscribers, and the method further 
comprises using the clustered second plurality of the scaled data arrays to se- 
lect candidate subscribers for service advertising. 

12. A method according to claim 1 , wherein said several entities are 
network resources and the method further comprises: 

using the scaled data arrays to determine a set of optimized operat- 
ing parameters for a network resource; and 

copying the optimized operating parameters to another network re- 
source. 

13. A data processing system for processing data arrays that collec- 
tively describe cyclic behavior of at least one variable in several entities in a 
physical process, wherein the cyclic behavior exhibits at least a repeating first 
cycle and each first cycle comprises multiple time slots; 

the data processing system comprising: 

an input routine for receiving multiple data arrays, each data array 
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containing multiple data items such that each data item describes a variable of 
an entity in one time slot; 

a magnitude-determination routine for determining a specific magni- 
tude parameter for each of the several entities; 

a scaling routine for scaling the data arrays between entities such 
that the specific magnitude parameters are suppressed; 

a training routine for training a clustering system with a first plurality 
of the scaled data arrays, to determine a set of cluster centers; and 

a clustering routine for clustering a second plurality of the scaled 
data arrays with the trained clustering system. 

14. A data processing system according to claim 13, wherein the 
magnitude-determination routine is operable to determine the specific magni- 
tude parameters separately for each first cycle. 

15. A data processing system according to claim 13, further com- 
prising a routine for processing the suppressed specific magnitude parameter 
separately from the clustering system. 

16. A data processing system according to claim 13, further com- 
prising an archival routine operable: 

to associate a predetermined confidence interval with the cluster 

centers; 

for each clustered data array of the second plurality, to determine a 
best-matching cluster center and to check if the clustered data array is within 
said predetermined confidence interval of the best-matching cluster center; 

if yes, to archive an indicator of the best-matching cluster center and 
to discard the data array in question; or 

if not, to archive data items of the data array for those time slots in 
which the clustered data array is not said within said predetermined confidence 
interval. 



